A Unit Selection-based Speech Synthesis Approach for Chinese Mandarin Text-to-Speech

نویسنده

  • Dong Minghui
چکیده

The paper presents a unit selection-based speech synthesis approach for Chinese Mandarin. Unit selection-based approach generates speech by directly connecting pre-recorded speech units. In this approach, a corpus is used as a source unit inventory. A feature vector is defined to describe each unit. To generate speech, the feature vector of the target unit is first calculated. During synthesis, we select units by considering the following: (1) each unit to be selected should have a proper prosody property (2) the adjacent units should be smoothly connected. To find the best unit sequence, Viterbi search algorithm is used in this approach. Prosody is one of the important concerns in speech synthesis. To generate natural speech, prosody should be carefully considered in unit selection process. In this paper, the authors use a prosody description to generate speech with good prosody. Experiment shows that this approach generates very high quality speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

A Unit Selection-based Speech Synthesis Approach for Mandarin Chinese

The paper presents a unit selection-based speech synthesis approach for mandarin Chinese. Unit selection-based approach generates speech by selecting proper units from a speech corpus and connecting them together. In this approach, a set of features are defined to describe the speech units in the corpus and the expected units in the synthesized utterance. Based on the features, cost function is...

متن کامل

Issues in Text-to-Speech Conversion for Mandarin

Research on text-to-speech (TTS) conversion for Mandarin Chinese is a much younger enterprise than comparable research for English or other European languages. Nonetheless, impressive progress has been made over the last couple of decades, and Mandarin Chinese systems now exist which approach, or in some ways even surpass in quality available systems for English. This article has two goals. The...

متن کامل

The WISTON Text to Speech System for Blizzard Challenge 2010

The paper introduces the speech synthesis system developed by Institute of Automation, Chinese Academy of Sciences(CASIA) for Blizzard Challenge 2010. The large corpus based speech synthesis system, WISTON, was built to synthesize Mandarin speech. In this year, a new prosodic structure prediction model was used, which is more precise and compact than before. Furthermore, two kinds of syllable s...

متن کامل

Hierarchical non-uniform unit selection based on prosodic structure

In speech synthesis systems based on wave concatenation, using longer units can generate more natural synthetic speech. In order to improve the usage of longer units in the corpus, this paper proposed a hierarchical non-uniform unit selection framework. Each layer included in the framework is an independent searching procedure which searches for different sized units and adopts suitable natural...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005